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ABSTRACT 

Alkylating agents often generate 3-methylcytosine 
(3meC) lesions that are efficiently repaired by AlkB 
homologues. If AlkB homologue proteins are not 
functional, or the number of 3meC lesions exceeds 
the cellular repair capacity, the damage will persist 
in the genome and become substrate of DNA poly- 
merases (Pols). Though alkylating agents are 
present in our environment and used in the clinics, 
currently nothing is known about the impact of 
3meC on the accuracy and efficiency of human 
Pols. Here we compared the 3meC bypass 
properties of six human Pols belonging to the 
three families: B (Pol 8), X (Pols p and k) and Y 
(Pols k, i and tj). We show that under replicative con- 
ditions 3meC impairs B-family, blocks X-family, but 
not Y-family Pols, in particular Pols r\ and i. These 
Pols successfully synthesize opposite 3meC; Pol i 
preferentially misincorporates dTTP and Pol t] 
dATP. The most efficient extenders from 3meC 
base-paired primers are Pols k and r\. Finally, using 
xeroderma pigmentosum variant patient cell 
extracts, we provide evidence that the presence of 
functional Pol ij is mandatory to efficiently overcome 
3meC by mediating complete bypass or extension. 
Our data suggest that Pol tj is crucial for efficient 
3meC bypass. 

INTRODUCTION 

Living organisms are exposed to alkylating agents present 
in shape of endogenous sources, such as cellular metabol- 
ism, exogenously in the environment, and in clinics as one 
of the most frequently used cancer chemotherapeutic 
drugs (1,2). Exposure of genomic DNA to alkylating 



agents often results in cytotoxic and/or carcinogenic mu- 
tations (3). Consequently, alkylation chemotherapy was 
shown to potentially induce secondary cancers 
characterized by genome instability and mutation accu- 
mulation (4). Alkylation DNA lesions, induced by S N 2 
alkylating agents, include 3-methylcytosine (3meC), 
3-methylthymine, 1-methyladenine and 1-methylguanine 
(5). 3meC lesions are predominantly generated in single- 
stranded (ss) DNA during replication, repair and tran- 
scription. In double-stranded (ds) DNA, the reactive site 
is involved in base pairing and thus protected from modi- 
fication (6). The 3meC lesions are efficiently repaired via 
direct oxidative reversal, mediated by iron(II) and 
2ketoglutarate (2KG)dependent dioxygenases: alkylation 
protein B (AlkB) in prokaryotes (7) and the AlkB homo- 
logues (ALKBH) 2 and 3 in eukaryotes (8,9). AlkB defi- 
cient Escherichia coli cells have elevated amount of 
alkylated lesions, display increased mutation frequency 
and are hypersensitive to alkylating agents (10,11). 
Similarly, exposure of mammalian cells lacking 
ALKBH2 to the alkylating agent methylmethane sulfon- 
ate (MMS) results in increased cytotoxicity and 
mutagenicity (12,13). Compared with ALKBH2 profi- 
cient, the deficient cells are characterized with large 
increase in C^-A and to certain extent C^-T mutations, 
when treated with MMS (13). Additionally, ALKBH2 has 
been implicated in cancerogenesis, by influencing cancer 
progression, as well as sensitivity to cisplatin treatment 
(14,15). Efficient removal of alkylation DNA damage is 
thus critical for cellular function, preservation of genomic 
integrity and survival. 

Increased cytotoxicity and mutation frequency, due to 
loss of functional AlkB or ALKBH2, suggest an impaired 
action of DNA polymerases (Pols) at the site of 
unrepaired alkylation damage. Besides a malfunctional 
direct reversal pathway, Pols can face 3meC in case: (i) 
when the amount of DNA damage exceeds the cellular 
repair capacity as well as (ii) if alkylation damage of 
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ssDNA occurs during DNA replication or repair. Upon 
encountering the DNA lesion, Pols can either (i) be com- 
pletely blocked, (ii) incorporate a nucleotide opposite the 
damage and then be blocked due to inability to extend or 
(iii) fully bypass by base pairing and extending past the 
lesion. Understanding the role(s) of the different Pols in 
the bypass of alkylation damage is complicated by the 
number of different Pols present in every organism, with 
a broad range of catalytic properties. For example, the 
prokaryotic organism E. coli contains five different Pols 
(16), whereas in eukaryotes sixteen Pols have been 
identified and, based on the sequence homology, 
grouped in five families — A, B, X, Y and reverse tran- 
scriptase (17). Addition of a methyl group to endocyclic 
N-atoms and subsequent generation of 3meC, 3- 
methylthymine, 1-methyladenine and 1-methylguanine 
lesions leads to DNA replication block in E. coli, poten- 
tially due to impairment of B-family replicative Pol III 
(10). In addition, the E. coli A-family gap-filling Pol I is 
inhibited by 3meC damage and besides certain incorpor- 
ation of dGTP, it can misincorporate dATP and dTTP 
opposite the lesion (18,19). While the replicative Pols are 
characterized with a high fidelity enabled by a tight-fitting 
active site and 3'— >■ 5' proofreading activity, the active sites 
of X- and Y-family translesion synthesis (TLS) Pols, 
which also lack a 3'— >■ 5' proofreading exonuclease 
activity, can readily accommodate and efficiently bypass 
different DNA damages. At the same time, this results in 
an increased error-rate on undamaged DNA (20-22). 
Accumulation of the Y-family TLS Pols, as result of 
SOS-response activation in E. coli, increases bypass effi- 
ciency of the 3-methylpirimidine and 1-methylpurine 
lesions by 3^1-fold (10,11). A recent structural study 
provided valuable insight into the features of prokaryotic 
Y-family Dpo4-mediated bypass of 3meC lesion (23). 
Presence of 3meC results in inhibition of Dpo4 and gen- 
eration of ~60% mutagenic products during remaining 
bypass (23). Crystal structure of Dpo4 ternary extension 
complex with correct and mismatched 3'-terminal primer 
base opposite the 3meC lesion indicates that the lesion 
remains positioned within the DNA template/primer 
helix, whereas the correct as well as incorrect nucleotides 
paired to 3meC are misaligned and exhibit significant 
primer terminal distortion (23). Unfortunately, similar 
structural or mechanistic studies in the presence of the 
blocking and mutagenic 3meC lesion with eukaryotic 
Pols are currently lacking. 

In this work, we characterized the impact of the 3meC 
lesion on six human Pols belonging to the three families B 
(Pol §), X (Pols p and X) and Y (Pols k, i and n). The 
properties of human Pols were compared on (i) running- 
and standing-start templates mimicking replicative condi- 
tions, as well as (ii) 29-, 7- and 1 -nucleotide (nt) gap con- 
structs resembling nucleotide excision (NER) and base 
excision repair (BER) intermediates, respectively. By 
using recombinant proteins, we show that under replica- 
tive conditions, 3meC impairs human B-family, represents 
a block for X-family enzymes, but can be successfully 
utilized by Y-family Pols, especially Pols i and r\. Single 
nucleotide incorporation experiments reveal that Pols i 
and r| successfully base-pair 3meC, Pol i by preferentially 



misincorporating dTTP and Pol r| dATP. The most effi- 
cient extenders of base-paired 3meC are Pols k and r\, 
their activity depending on the DNA structure. Finally, 
by using human cell extracts, we provide evidence that 
in the context of the cell the Pol crucial for efficient 
3meC bypass is Pol r|, as its lack results in an inability 
to successfully overcome the lesion. These results suggest 
the existence of a novel Pol ^-dependent pathway that 
could reduce the mutagenic effects of alkylation treatment 
and potentially facilitate cellular survival, especially under 
conditions when the functional ALKBH proteins, respon- 
sible for direct reversal repair, are missing. 

MATERIALS AND METHODS 

Labeled y[ 32 P] ATP was purchased from Hartmann 
Analytic. Deoxyribonucleoside triphosphates were from 
Sigma. All other reagents were of analytic grade and 
purchased from Fluka, Sigma or Merck. The 40, 39, 35, 
32, 26 and 17mer and oligonucleotides were obtained from 
Microsynth, whereas 72mer containing 3meC lesion or 
undamaged C were purchased were from Purimex. 

Preparation of DNA substrates 

The sequences of the oligonucleotides used were as 
follows: 72mer template, 5'GTATTAGATATTCGGGA 
GGTTGGGCGCCGGCGXTGTGAATTCGGCACTG 
GCCGTCGTATGCTCTTGGTTGTA3' ; 40mer primer, 
5'TACAACCAAGAGCATACGACGGCCAGTGCCG 
AATTCACAN3'; 39mer primer, 5'TACAACCAAGAGC 
ATACGACGGCCAGTGCCGAATTCACA3'; 17mer 
primer, 5'TACAACCAAGAGC ATAC3' ; 35mer 5' AAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
3'; 32mer 5'CGCCGGCGCCCAACCTCCCGAATATC 
TAATAC3'; and 26mer, 5'CGCCCAACCTCCCGAAT 
ATCTAATAC3' . The letter X corresponds to the 
position of 3meC and C in the damaged and undamaged 
DNA template, respectively. The letter N in 40mer indi- 
cates four possible nucleotides at the primer 3' terminus: 
G, A, T or C. These four different 40mers were annealed 
to template strand and used in the extension assay experi- 
ments (Figure 5 and Supplementary Figure S6). The 
primer/template resembling running-start conditions was 
generated by annealing 1 7mer primer and 72mer template 
(Figure 1 and Supplementary Figure SI), whereas anneal- 
ing of 39mer primer and the same template resulted in 
standing-start situation (Figures 2, 6 and Supplementary 
Figure S2). The 29-nt gap construct was created by an- 
nealing 17mer primer and 26mer downstream oligonucleo- 
tide to 72mer template (Figure 3 and Supplementary 
Figure S3). Annealing of 39mer primer and downstream 
32mer to 72mer template yielded 1-nt gap construct 
(Figure 4). When instead of 32mer a 26mer was used, 7- 
nt gap was generated (Supplementary Figures S4 and S5). 

The 17, 39 and 40mer primers were labeled with T4 
polynucleotide kinase (New England Biolabs) in the 
presence of [ 32 P] ATP. Each labeled oligonucleotide was 
mixed with the 72mer template oligonucletide at 1:1 
(M/M) ratio in the presence of 20 mM Tris-HCl 
(pH 7.4) and 150 mM NaCl, heated at 95°C for 10 min 
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and then slowly cooled down at room temperature. 
Similarly, to obtain 29-, 7- and 1-nt gap constructs, 
labeled primer, template and downstream oligonucleotide 
were mixed in ration 1:1:1 (M/M/M). 

Expression and purification of recombinant proteins 

Human recombinant Pol 8 and PCNA were expressed 
and purified as described in (24,25). Human recombinant 
Pols X and |3 were purified as noted previously in (26). The 
bacterial clones for the expression of human Pols n and i 
were a gift from Roger Woodgate (NIH, USA). Human 
recombinant Pols ^ and i were expressed and purified as 
described (27). Pol k was purchased from Enzymax, USA. 

Cell lines and culture conditions 

XP30R0 fibroblasts are simian virus 40-transformed cells 
derived from a patient suffering from the variant form of 
xeroderma pigmentosum (XPV). The cells harbor a 13-bp 
deletion resulting in the expression of a truncated and 
non-functional Pol n. XP30R0 cell line was a gift from 
Alan Lehman (University of Sussex, Brighton, UK). Cells 
were grown in a humidified 5% C02 atmosphere in 
Dulbecco's modified Eagle's medium containing 
GlutaMAX-I supplemented with 10% fetal bovine 
serum and lOOU/ml penicillin-streptomycin (all 
obtained from Gibco, Invitrogen). To overexpress 
human Pols r\ and i mammalian expression vectors 
pJRM56 and pJRM46 [provided by Roger Woodgate 
(NIH, USA)] were used. Empty vector pcDNA 3.1 
(Invitrogen), pJRM46 or pJRM56 expression constructs 
were transiently transfected into XPV cells using 
lipofectamine (Invitrogen) according to manufacturers 
protocol. Forty-eight hours after the transfection, cells 
were harvested and whole-cell extract (WCE) prepared. 

Preparation of WCE 

Cells were trypsinized, washed twice with cold PBS and 
resuspended in three packed cell volumes of hypotonic 
lysis buffer [lOmM Tris-HCl (pH 8), ImM EDTA, 
10%(w/v) sucrose, 10% (v/v) glycerol, 0.1 mM PMSF, 
1 mM DTT, 1 ug/ml pepstatine, 1 ug/ml bestatine and 
1 (ig/ml leupeptin]. After swelling for 20min at 4°C, cells 
were lysed with a Dounce homogenizer (tight pastille). 
NaCl was added next to a final concentration 0.35 M 
and incubated lOmin at 4°C. Samples were diluted with 
seven packed cell volumes of lysis buffer, sonicated and 
centrifuged 20min at 15 000^. The supernatants were flash 
frozen in liquid N 2 or analyzed directly. 

Protein analysis 

WCEs were separated on 10% Bis-Tris poly-acrylamide 
gels followed by transfer to Immobilon-FL membrane 
(Millipore) for immunoblotting analysis. Primary 
antibodies against Pol r\ (Abeam), Pol i [custom made 
antibody, provided by Roger Woodgate (NIH, USA)] 
and Tubulin (Sigma) were detected using infrared Dye- 
conjugated secondary antibodies (LI-COR Biosciences). 
The signal was visualized using direct infrared fluores- 
cence via the Odyssey Scanner (LI-COR Biosciences). 



Enzymatic assays 

Recombinant human enzyme assays 

For denaturing gel analysis of DNA synthesis products, 
the reaction mixture (10 ul) contained 40mM Tris-HCl 
(pH 8), lOmM DTT and 250 ug/ml bovine serum 
albumin. In case of Pol 8 reactions, 5mM Mg 2+ was 
present, whereas for all other Pols, 1 mM Mg~ + was 
used. Concentrations of Pols and the 5' 32 P-labeled 
DNA in the reaction were as indicated in the Figures, as 
well as Figure Legends. The amounts of dNTPs were as 
follows: 10 uM for Pol 8, 1 uM in case of Pol X and 5 uM 
for all other Pols. All reactions were performed at 37°C 
for 15 min, and the reaction products analyzed by 7M 
urea/12.5% polyacrylamide gel electrophoresis. 

Assay with WCEs 

For denaturing gel analysis of DNA synthesis products, 
the 10 ul of reactions containing 40 mM Tris-HCl (pH 8), 
lOmM DTT and 250 ug/ml bovine serum albumin, ImM 
Mg 2+ , 2nM 5' 32 P-labeled DNA, 10 uM unlabeled 35mer 
trap DNA, 100 uM dNTPs and 3ul of (1.33ug/ul) WCE 
were incubated at 37°C for 30 min, and reaction products 
were separated by 7M urea/12.5% polyacrylamide gel 
electrophoresis. 

Steady state kinetic analysis 

Reactions were performed as described earlier in the text, 
each reaction containing 20 nM of DNA template and 
15nM of either Pol i or r|. Quantification was performed 
by densitometry with a Phosphorlmager (Typhoon Trio, 
GE Healthcare). The initial velocities of the reaction were 
calculated from the values of integrated gel band 
intensities with the programs GelEval 1.35 scientific 
imaging software (FrogDance Software, UK) and 
GraphPad Prism 5.0: 

PT/IT- 1, 

where T is the target site, the template position of interest; 
PTis the sum of the integrated intensities at positions T, 
T+ 1,. . ., T+n. All of the intensity values were normalized 
to the total intensity of the corresponding lane to correct 
for differences in gel loading. The apparent k M and k cal 
values were calculated by plotting the initial velocities in 
dependence of the nucleotide [dNTP] or primer [3'-OH] 
substrate concentrations and fitting the data according 
to Michaelis-Menten equation: 

k cat [E\ 0 /(l+k M /[S\), 

where [E] Q , was the input enzyme concentration and 
[S] was the variable substrate. Substrate incorporation 
efficiencies were defined as the k cat /k M ratio. Under 
single nucleotide incorporation conditions, 
kcat = k pol k off /(k po i + k off ) and k M = k d k off /(k pol + k off ), 
with kp 0 i being the true polymerization rate, k off the dis- 
sociation rate of the enzyme-primer complex and k d the 
true Michelis constant for nucleotide binding. Thus, 
kcat/kM values are equal to k pol /k d . Substrate concentra- 
tions used for dNTPs were 0.1-10 uM. 
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Time-dependent accumulation of products was fitted to 
the mixed exponential equation: 

[P] = A(l -e- k ^') + k ss t, 

where [P] designates the products formed, A the burst 
amplitude, t time, k ohs the apparent burst rate, and k ss 
the steady-state rate, which corresponds to k 0 fj. 

Free energy change (AAG) values were estimated 
according to the relationship: 

AAG = RT \n(k), 

where R is the gas constant (1.9872 cal x mol -1 x K -1 ), T 
is the absolute temperature (K), and k is the ratio between 
the kinetic constants considered. 

RESULTS 

The Y-family DNA polymerase n successfully 
bypasses 3meC 

3meC is predominantly generated in ssDNA, which is 
largely present during DNA replication. To address the 
impact of 3meC damage occurring directly in front of 
the replication machinery, bypass efficiency of six 
human Pols was tested under running-start conditions 
(Figure 1A), using a primer/template containing the 
lesion at position +23 from the primer. For efficient 
lesion bypass, Pols need to elongate the primer up to the 
lesion, incorporate a nucleotide opposite the damage and 
extend from it. The activity of the major lagging-strand 
replicative Pol 8 was partially impaired in the presence 
of 3meC damage (Figure IB). Pol 8 reached the lesion 
and incorporated a nucleotide opposite 3meC; however, 
its further extension was reduced resulting in very limited 
amount of full-length product (Figure IB, lanes 2-A). This 
suggested that, upon encountering the lesion, the replica- 
tion fork could potentially be significantly slowed-down or 
stalled and for its successful continuation another more 
specialized Pol would be required. The repair X-family 
Pols (3 and k are known to bypass small base lesions 
(17,20), but they were fully blocked by the 3meC 
damage (Figure IB, lanes 6-12). The three different 
members of TLS Y-family Pols k, i and r| were tested 
next on 3meC (Figure 1C). As a template DNA in its 
sequence contains all four bases including undamaged T, 
which Pol i favorably mispairs with 'wooble' G (28), Pol i 
was unable to efficiently elongate the running-start primer 
and reach the lesion (Figure 1C, lanes 6-8). Pol k in con- 
trast very efficiently elongated the primer, but exhibited 
strong pausing at the position immediately preceding the 
lesion. It nonetheless was able to partially bypass 3meC, 
by generating products one or two nucleotides shorter 
than the full-length (Figure 1C, lanes 2-4). This suggested 
that Pol k either loops out 3meC lesion and continues the 
synthesis or generates deletions during elongation after 
pairing of 3meC. To distinguish between the two, the 
length of Pol k products was compared in the presence 
of undamaged and damaged running-start templates 
(Supplementary Figure SI A). In both cases, the products 
generated by Pol k were one or two nucleotides shorter 



than the full-length (Supplementary Figure SIB, compare 
lanes 1-8 with lanes 9 and 10), thus indicating that under 
tested conditions, Pol k most likely induces deletions 
during elongation. Pol r\ finally was the most efficient in 
bypassing 3meC under running-start conditions. Pol i"| in 
fact elongated the primer and efficiently generated full- 
length product, showing only a modest pausing at the 
lesion site (Figure 1C, lanes 11 and 12). Taken together, 
these first results indicated that under running-start con- 
ditions, 3meC (i) blocks human X-family Pols, (ii) impairs 
the activity of B-family Pol 8 and Y-family Pol k, but (hi) 
can successfully be processed by TLS Pol r\. 

DNA polymerases i and r\ efficiently, but inaccurately, 
base-pair 3meC 

While considering the bypass efficiency of a certain DNA 
lesion, it is additionally essential to determine its impact 
on Pol fidelity. To assess the accuracy of 3meC bypass, the 
primer was annealed to template strand directly in front 
of the lesion resulting in standing-start conditions 
(Figure 2A). The amount of dNTPs in the reaction was 
modulated according to the known affinity of each Pol. 
Replicative Pol 8 under standing-start conditions prefer- 
entially misincorporated dATP, followed by dTTP 
opposite 3meC (Figure 2B, lanes 4 and 5). Additionally, 
in the presence of all 4 dNTPs, Pol 8 incorporated a single 
nucleotide opposite the lesion and in comparison with 
control template exhibited dramatically reduced extension 
capacity (Figure 2B, lane 2; Supplementary Figure S2), 
thus confirming the observation from Figure IB. The 
presence of 3meC on standing-start primer/template 
strongly inhibited the activity of Pols p and k, as well as 
k (Figure 2C-E). Interestingly, Pol k weakly inserted two 
dCTPs (Figure 2E, lane 6), suggesting that under 
standing-start conditions, this Pol either (i) potentially 
skips 3meC and copies the next template base (G) twice 
or (ii) incorporates first dCTP opposite the lesion and the 
second one across subsequent template base. In compari- 
son with other human Pols tested, the Y-family Pols i and 
r] efficiently base paired the lesion (Figure 2F and G). 
Interestingly, Pol i more efficiently incorporated dGTP 
opposite 3meC, than across undamaged C (Figure 2F, 
compare lane 1 with lane 3). While Pol i successfully 
mispaired 3meC with dTTP, Pol i] with comparable effi- 
ciency incorporated the inaccurate dATP as well as the 
correct dGTP opposite 3meC (Figure 2F and G; 
Table 1). In addition, it also showed a low discrimination 
against the incorporation of dTTP and dCTP (Table 1). 
Similar to Pol k, in the presence of dCTP incorporation of 
two nucleotides was observed (Figure 2G, lane 6). 
Suggesting that Pol r\ after copying the lesion incorporates 
second dCTP opposite the next template base or that it 
skips 3meC and copies subsequent template G twice. 
Overall, the substrate affinity and conversion efficiency 
were several fold higher for Pol r| than i (Table 1). Pol 
r| further completed the bypass by generating full-length 
products in the presence of all 4 dNTPs (Figure 2G, lane 
2). These findings suggested that Y-family Pols i and r) 
efficiently misincorporate nucleotides opposite 3meC, with 
Pol r) being the more accurate of the two. 
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Figure 1. 3meC bypass by six human DNA polymerases under running-start conditions. (A) Schematic representation of running-start 
primer template containing 3meC lesion designated as X. Numbers above the template indicate the positions of template bases with respect 
to the lesion. FL stands for full-length. (B, C) Titration of Pols S, p, X (B) and Pols k, i, r| (C) under running-start conditions in 
the presence of 2nM DNA and indicated PCNA amounts. The experiments were done under conditions as described in 'Materials and Methods' 
section. 



DNA polymerase t) is the most proficient in filling 3meC 
containing DNA gaps, mimicking NER and BER 
intermediates 

In addition to DNA replication, ssDNA regions fre- 
quently exist in the shape of gap intermediates produced 
during repair processes such as NER and BER. 
Generation of alkylation damage while the NER or 
BER machineries are still processing these ssDNA gaps 
would require lesion bypass for efficient gap closure. To 
determine the ability of Pols to close NER gap intermedi- 
ate containing a lesion, a 29-nt gap construct with 3meC at 
position +23 was generated. As the presence of a phos- 
phate group can dramatically affect bypass efficiency of 
certain Pols (29), the 5'-terminus of the gap was either 
phosphorylated (Figure 3A) or not (Supplementary 



Figure S3A). As expected, titration of Pol 5 revealed 
that this Pol efficiently reached the 3meC, was able to 
incorporate a nucleotide opposite the lesion, but was 
poor in subsequent extension, resulting in inefficient gap 
filling independently of the gap 5'-end phosphorylation 
status (Figure 3B and Supplementary Figure S3B). 
While Pol (3 was once more blocked by 3meC, Pol A, 
another X-family member, efficiently closed the 29-nt 
gap, when the 5'-end of the gap was phosphorylated, but 
not when the phosphate group was lacking (Figure 3B, 
lane 12 compare with Supplementary Figure S3B, lane 
12). Interestingly, Pol X predominately generated 
products one nucleotide shorter than the gap size 
(Figure 3B, lane 12), indicating that it either stops just 
before the 5'-end of the gap or it generates single 
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Figure 2. Efficiency and fidelity of six human DNA polymerases on the 3meC template under standing-start conditions. (A) Standing-start primer 
template with undamaged C or 3 meC lesion designated as X. Numbers above the template indicate the position of template bases with respect to the 
lesion at +1. FL is full-length product. (B-G) Extension of the 2nM standing-start primer template by 2nM Pol S in the presence of lOOnM PCNA 
(B), 2nM Pol p (C), 2nM Pol A. (D), 2nM Pol k (E), 2nM Pol i (F) and 2nM Pol n (G). Reactions in lanes 1 and 3-6 contain dGTP, dATP, dTTP 
and dCTP, respectively. In lane 2, all four dNTPs (N) were added. The experiments were performed under conditions as specified in 'Materials and 
Methods' section. 



nucleotide deletions, potentially by looping out the lesion. 
Similarly to running start conditions, Pol i was not able to 
reach the lesion (Figure 3C and Supplementary Figure 
S3C). Pol k showed a strong pausing site immediately 
before the lesion but was eventually able to close the 



gap containing both non- and phosphorylated 5'-end 
(Figure 3C and Supplementary Figure S3C), as well as 
continuing up to the end of the template, displacing the 
downstream strand. Pol r\ on the other hand efficiently 
bypassed the lesion and completely filled the gap 
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Table 1. Kinetic parameters for nucleotide incorporation by DNA polymerases i and r\ opposite 3meC in the template 
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Pol T] 




k M (uM) 
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Wk M (M-V) 


Fidelity" 


k M (uM) 
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Fidelity 11 


dGTP 


6 


0.3 


0.05 x 10 3 




2 


0.6 


0.31 x 10 3 




dATP 


n.a. 


n.a. 


n.a. 


n.a. 


1.5 


0.7 


0.46 x 10 3 


0.7 


dTTP 


12.64 


0.9 


0.07 x 10 3 


0.7 


18.4 


0.7 


0.04 x 10 3 


8.9 


dCTP 


3.5 


0.2 


0.05 x 10 3 
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(Figure 3C and Supplementary Figure S3C). It also 
showed robust strand displacement of both 
phosphorylated and non-phosphorylated 5'-gap oligo- 
nucleotide (Figure 3C and Supplementary Figure S3C). 



Next, the efficiency and accuracy of 3meC bypass were 
addressed on a shorter 7-nt gap template, mimicking 
a long-patch BER intermediate, with phosphorylated 
(Supplementary Figure S4A) or non-phosphorylated 
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(Supplementary Figure S5A) 5'-gap terminus and bearing 
the lesion at position +1. B-family Pol 8 was impaired by 
3meC lesion, irrespectively of the 5'-gap status and only 
weakly misincorporated dATP and dTTP, being then 
unable to continue synthesis (Supplementary Figures 
S4B and S5B). Even though Pol (3 is suited to fill short 
gaps, the presence of 3meC entirely blocked its activity 
(Supplementary Figures S4C and S5C). Pol k was com- 
pletely blocked with 5'-phosphorylated gap 
(Supplementary Figure S4E) and showed only a very 
weak activity on the unphosphorylated substrate 
(Supplementary Figure S5E). In contrast, Pol r| again ef- 
ficiently closed the gap, incorporating mainly the incorrect 
dATP and correct dGTP opposite the lesion, as well as to 
a lesser extent dCTP and dTTP (Supplementary Figures 
S4G and S5G). Pols X and i mispaired 3meC with dCTP 
(Supplementary Figure S4D, lane 6) and dTTP 
(Supplementary Figure S4F, lane 5 and Supplementary 
Figure S5F, lane 5), respectively. In comparison to 
standing-start conditions (Figure 2F), the presence of 
3meC did not stimulate Pol i activity. While Pol i was 
unable to close the gap (Supplementary Figures S4F and 
S5F, lane 2), Pol X after dCTP misincorporation reached 
the end of the gap, albeit with low efficiency and only 
when 5' gap terminus was phosphorylated (compare 
Supplementary Figure S4D, lanes 2 and 6 to 
Supplementary Figure S5D, lanes 2 and 6). As under 
standing-start conditions Pol X could not bypass 3meC 
(Figure 2D), the present observation raised the possibility 
that in the context of short-gap 3meC is not directly 
mispaired with dCTP, but instead that Pol X uses its 
ability to scrunch the template (30) and loop out the 
lesion, thus incorporating dCTP opposite template G at 
position +1. However, it is also possible that the presence 
of the 5'-phosphate, which is known to increase the effi- 
ciency of Pol X, may allow direct incorporation opposite 
the lesion. 

DNA polymerases i and n can fill a single-nucleotide gap 
positioned opposite 3meC 

While long-patch BER is more common during replication, 
the majority of BER reactions in G1/G2 phases give raise 
to ssDNA in shape of a 1-nt gap intermediate, as a conse- 
quence of the combined glycosylase and AP endonuclease 
action. Thus, we next tested the fidelity on a template con- 
taining a 1-nt gap positioned opposite 3meC (Figure 4A) 
and bearing a 5'-phosphorylated end. Replicative Pol 8 was 
as expected impaired at this template and extremely poorly 
misincorporated dATP and dTTP opposite the lesion 
(Figure 4B). Pol p was unable to act at the site of 3meC 
damage, even under these classical BER conditions 
(Figure 4C). Surprisingly, Pol X that is known to efficiently 
perform TLS on 1-nt gap (26,31) was blocked in the 
presence of 3meC. This suggested that during copying of 
the larger 7- and 29-nt gaps (Figure 3 and Supplementary 
Figure S4) Pol X scrunched the template, looped out the 
damaged nucleotide, elongated and consequently induced 
single nucleotide deletions. From the Y-family enzymes, 
Pol k did not act on the lesion template (Figure 4E), 
whereas Pol i and i"| efficiently filled the gap. Pol i 



misincorporated dTTP (Figure 4F, lane 5) and its 
activity, similarly to 7-nt gap (Supplementary Figures S4 
and S5), was not stimulated by 3meC. Thus, suggesting that 
presence of the lesion stimulates Pol i activity only in rep- 
licative context (Figure 2F). Pol r| on 1-nt gap template 
incorporated the correct dGTP, as well as the incorrect 
dATP (Figure 4G, lanes 3 and 4). Interestingly, on this 
template, no incorporation of dTTP or dCTP by Pol r) 
was noted, contrary to what was observed under 
standing-start conditions and with the longer gaps. 
Further implying that the previously detected dCTP and 
dTTP incorporation by Pol r] (Figures 2G, 
Supplementary Figure S4G and S5G), as well as dCTP 
by Pol k (Figure 2E) resulted from incorporation 
opposite alternative templating bases, which cannot take 
place in a 1-nt gap. When all 4 dNTPs were present in 
the reaction Pol i"| generated a +1 product with electrophor- 
etic mobility more similar to that of dATP than dGTP, 
indicating potentially inaccurate bypass of 3meC in a 1-nt 
gap (Figure 4G, compare lanes 2, 3 and 4). Taken together, 
as in the case of the previously used standing-start and 
gapped templates, Pols i and r\ were the most proficient 
Pols in performing TLS over a 3meC lesions. 

DNA polymerases k and r| are best extenders from 3meC 

In many cases, TLS is a two-step process involving incorp- 
oration of a nucleotide opposite a lesion by a certain Pol 
and subsequent extension of the primer by a specialized 
extender Pol. To test the extension properties of different 
Pols, a primer/template ending with either the correct 
G:3meC or the three incorrect A:3meC, T:3meC or 
C:3meC base pairs was created (Figure 5A). All six 
tested human Pols successfully utilized the control C:G 
substrate, however with different efficiencies (Figure 5B, 
lanes 2-7). In presence of the lesion, Pol 8 only weakly 
extended T:3meC and C:3meC containing constructs 
(Figure 5B, lanes 23 and 30). In contrast, Pols k and r\ 
utilized both correctly, as well as all three incorrectly 
primed lesion templates (Figure 5B, lanes 12, 14, 19, 21, 
26, 28, 33 and 35) and from all the six Pols tested were the 
two most efficient in extending all the substrates. While 
Pol r| generated full-length products, Pol k in contrary 
created products one or two nucleotides shorter than the 
full-length (Figure 5B, lanes 12,19, 26 and 33), in agree- 
ment with its tendency to introduce —1 deletions (32). 

To address whether the observed extension capacity was 
construct dependent, we next generated a 6-nt gap with 
either correct G:3meC or incorrect A:3meC, T:3meC and 
C:3meC base pair located at the 3'-end of the gap 
(Supplementary Figure S6A). From these substrates, Pol 
8 only poorly processed A:3meC, C:3meC and T:3meC 
gaps (Supplementary Figure S6B, lanes 16, 23 and 30). 
Pol X on the other hand was able to extend all lesion con- 
structs and close 6-nt gap (Supplementary Figure S6B). 
Similarly, Pols k and r) acted on all the templates and, as 
with the previous substrates, were the most efficient exten- 
ders. Interestingly, under the 6-nt gap condition, Pol k was 
the most active especially if gap contained T:3meC or 
C:3meC (Supplementary Figure S6B, lanes 26 and 33). 
Pol r|, as well as Pol k but only on the T:3meC substrate. 



Nucleic Acids Research, 2014, Vol. 42, No. 1 561 



5' CACA C G C 3' 

3' GTGTXGCG 5' 



X = C or 3meC 



B 



X = 3meC 



X = 3meC 



X = 3meC 




Lane 12 3 4 5 
dNTP(10u,M) G N G A T C 
Pol 5 + PCNA 



Lane 12 3 4 
dNTP(5u.M) G N G A 
Pol p 



5 6 
T C 




+1 — 



Lane 1 2 3 4 5 6 
dNTP(1[xM) G N G A T C 
Pol X 



X = 3meC 




+1- 



Lane 12 3 4 
dNTP(5|iM) G N G A 
Pol K 



5 6 
T C 



o 
ii 

X 



X = 3meC 




+1 



Lane 1 2 3 4 5 6 
dNTP(5nM) G N G A T C 
Pol i 



o 
ii 

X 



X = 3meC 




+ 1 



Lane 1 2 3 4 5 6 
dNTP(5(xM) G N G A T C 
Pol ti 
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displayed a robust strand displacement activity. In case of a 
C:3meC template, both Pols X and k predominantly 
generated five nucleotides long product, a patch one nu- 
cleotide shorter than the gap itself, thus indicating that 
both Pols potentially re-prime the template by repositioning 
the 3' C and base-pairing it with G at position +1 from the 
lesion. In summary, these findings indicated that Pols k and 
r| are the most efficient extenders of 3meC, and that their 
activity is influenced by the structure of the DNA construct. 

Efficient bypass of 3meC in human WCEs requires the 
presence of a functional DNA polymerase r\ 

So far in the experiments testing recombinant human 
enzymes, we demonstrated that Pol t successfully 
incorporated a nucleotide opposite 3meC, whereas Pol rj 
was more efficient in extending a mismatched primer con- 
taining 3meC. Uniquely, among all Pols tested, Pol r\ was 
efficient in both: (i) incorporation opposite the lesion as 
well as (ii) subsequent extension (Figures 1-5). To verify 



whether Pol r\ was indeed playing a major role in 3meC 
bypass, even in the presence of other Pols such as in the 
context of a whole cell, we tested the fidelity and bypass 
capacity of various cellular extracts under standing-start 
conditions (Figure 6A). WCEs were prepared from XPV 
patient cells lacking endogenous functional Pol r\. XPV 
cells were complemented with either (i) empty vector, (ii) 
human Pol i] or (hi) Pol i expression constructs 
(Figure 6B). Interestingly, although all three WCEs were 
comparably active on a control template containing un- 
damaged C (Figure 6C, lanes 1, 7 and 13), only WCE 
prepared from XPV cells that had been complemented 
with Pol T) showed robust incorporation opposite 3meC 
(Figure 6C, lanes 8-12). As with the recombinant enzyme 
(Figure 2G), Pol r| complemented WCE predominantly 
incorporated dATP and dGTP opposite the 3meC 
(Figure 6C, lanes 9 and 10). In the presence of 4 dNTPs, 
+ 1 product with electrophoretic mobility similar to the 
one of A incorporated opposite the lesion was obtained 
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(Figure 6C, compare lanes 8 and 10). This suggested that 
Pol n potentially bypasses 3meC in an inaccurate manner 
and further coincided with the results of reactions using 
only the purified protein (Figure 4G). Pol r\ mediated 
bypass in context of WCE was partially efficient, due to 
low amount of extension product generated (Figure 6C, 
lane 8). We thus hypothesized that the simultaneous 
increase in the amounts of (i) Pol i as inserter and (ii) 
Pol T) as both inserter and extender of 3meC could 
improve the bypass efficiency. Expression of both Pols 
in XPV cells (Figure 6B) resulted in promoted incorpor- 
ation of all 4 nucleotides (Figure 6C, lanes 21-24), as well 
as an increased amount of extension product (Figure 6C, 
compare lanes 8 and 20), overall leading to improved 
3meC bypass efficiency. Taken together these findings 



clearly indicated that the presence of functional Pol r\ in 
human cells is crucial for 3meC bypass, which can be 
mediated by either Pol r\ alone or as a two-step process 
involving an additional Pol, such as Pol v. 



DISCUSSION 

Exposure of genomic DNA to alkylating agents results in 
generation of numerous DNA lesions. While the pathways 
to repair alkylating DNA lesions are well characterized, 
much less is known about the influence of alkylation 
damage on different Pols. Several studies described the 
properties of prokaryotic Pols on the alkylation lesion 
3meC; however, it is currently unknown how this 
damage affects eukaryotic Pols. In this work, we 
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characterized the properties of 3meC bypass by the six 
human Pols 8, p, X, k, i and r\, members of the 
B-, X- and Y-families. Our findings strongly suggest that 
among the six tested enzymes, Poln most efficiently 
bypassed 3meC- On open or long gapped substrates, Pol 
r| -mediated 3meC bypass was error-prone, leading to the 
incorporation of almost all four nucleotides with selectiv- 
ity indexes ranging from 0.7 (for dATP) to 9 (for dTTP) 



with respect to the correct dGTP (Table 1). However, on 
single nucleotide gapped substrates, incorporation was re- 
stricted to dATP and dGTP (Figure 4G). Besides its 
ability to incorporate the incorrect dATP, as well as the 
correct dGTP opposite 3meC, Pol n efficiently extended 
both correctly and incorrectly paired primer termini on 
lesion DNA. Pol r\ was unique among all the tested Pols 
in combining the properties of both an inserter and 



564 Nucleic Acids Research, 2014, Vol. 42, No. 1 



extender. We further confirmed its importance in the 
context of a whole cell; by showing that lack of Pol r\ in 
human cell extracts results in a severely reduced ability to 
act at the site of 3meC (Figure 6C). The importance of 
functional Pol r\ in response to monoalkylating agents 
has been already suggested by in vivo experiments 
showing that the Saccharomyces cerevisiae rad30D strain 
lacking Pol n is somewhat sensitive to MMS (33,34). 
Furthermore, MMS treatment of human cells was 
shown to result in formation of Pol r\ foci at the site of 
replication forks blocked by alkylation lesions (35), one of 
which is potentially 3meC. In addition to Pol r\, another 
Y-family member, Pol i efficiently bypassed 3meC by pref- 
erentially misincorporating dTTP opposite the lesion but 
was unable to extend it. In the context of WCEs, an 
increase in the level of Pol i as inserter resulted in 
promoted Pol remediated 3meC bypass (Figure 6C). 
Interestingly, another Y-family enzyme, Pol k, was a 
poor inserter but proved to be the most efficient in extend- 
ing from a T:3meC mismatch. Although the generation of 
G:3meC pair by Pol r\ would create another chance for 
ALKBH2 to repair and thus remove lesion from the 
genome, it is currently not clear which consequences the 
persistence of A:3meC, as well as T:3meC, created by Pol 
r| and i respectively, would have on genome stability. If 
repair of A:3meC and T:3meC would be possible by 
ALKBH2 proteins, it would result in harmful mismatches 
and could potentially lead to mutations in the next round 
of DNA replication. Generation of mismatches would be 
especially detrimental in tissues lacking functional 
mismatch repair, as was observed for various tumour 
types (36,37). To prevent genomic instability in precancer- 
ous tissues, it would thus be crucial to activate a Pol that 
can rather faithful bypass 3meC, such as Pol r\. 

Under replicative conditions, 3meC impaired the 
activity of the B-family replicative Pol 8 and blocked the 



repair Pols p and X (Figure IB). A similar finding was seen 
upon 3meC extension, condition under which Pol 8 only 
very weakly generated full-length product. Thus suggest- 
ing that, upon encountering the lesion, the replication fork 
could potentially be stalled and for its successful continu- 
ation, another more specialized Pol, like Pol r\, would be 
required. These results correlate well with findings 
obtained in prokaryotes demonstrating impeded activity 
of Pols I and III at the site of 3meC (10,18,19). 
Interestingly, though Pol X was unable to directly incorp- 
orate nucleotides opposite the lesion, in the context of 
gapped DNA, larger than 1 nt, this Pol successfully 
scrunched the template and continued the synthesis by 
pairing the incoming nucleotide with the template base 
at +1 position from the lesion. The property of Pol X to 
scrunch the template was directly shown by the crystal 
structure published previously by Garcia-Diaz et al. (30). 
Also, when the terminal nucleotide of the primer was com- 
plementary to the nucleotide immediately downstream the 
lesion, Pol X was able to loop out 3meC and use the base 
downstream to re-align the primer. This slippage ability of 
Pol X has been already observed for other lesions (38). In 
both cases, the result was a 1-nt deletion. 

Based on our findings, we propose a model for 3meC 
bypass in context of DNA replication (Figure 7). When 
the lesion is not repaired by ALKBHs and remains present 
in the template strand, Pols might handle it in one of the 
following ways: (1) after copying 3meC by incorporating 
inaccurate dATP or dTTP, the replicative Pol 8 is stalled, 
and TLS Pol r\ is recruited to extend the mispair 
(Figure 7-1); (2) the replication fork is stalled in front 
of 3meC leading to Pol r\ recruitment, predominant inser- 
tion of dATP opposite the lesion and the extension 
(Figure 7-2); (3) the replication fork is stalled in front of 
3meC, Pol i is recruited to tolerate the lesion by 
misincorporating dTTP and Pol r\ to extend the newly 
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formed T:3meC base-pair (Figure 7-3). In summary, 
our findings suggest the potential existence of novel Pol 
r| -dependent pathway that would enable cells to overcome 
cytotoxic effects of alkylating agents, even at the cost 
of potentially causing mutations, especially in the 
absence of functional direct reversal proteins. 
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